Parallel I/o Scheduling and Buffer Management
نویسنده
چکیده
Parallel I/O systems are an integral component of modern high performance systems, providing large secondary storage capacity, and having the potential to alleviate the I/O bottleneck of data intensive applications. In these systems the I/O buffer can be used for two purposes (a) improve I/O parallelism by buffering prefetched blocks and making the load on disks more uniform, and (b) improve I/O latency by caching blocks to avoid repeated disk accesses for the same block. To make best use of available parallelism and locality in I/O accesses, it is necessary to design and implement prefetching and buffer management algorithms that schedule reads intelligently so that the most useful blocks are prefetched into the buffer and the most valuable blocks are retained in the buffer when the need for evictions arises. This dissertation focuses on prefetching and buffer management algorithms for parallel I/O systems. Our aim is to exploit the high parallelism provided by multiple disks by using appropriate buffer management to reduce the average read latency seen by an application. The thesis is that, prefetching and buffer management in parallel I/O systems is fundamentally different from that in systems with a single disk, thereby necessitating new algorithms to handle it. To this end we first present evidence to show the limitations of intuitive algorithms that generalize those for sequential systems to a parallel I/O system, and then design algorithms that have better performance.
منابع مشابه
PC-OPT: Optimal Offline Prefetching and Caching for Parallel I/O Systems
We address the problem of prefetching and caching in a parallel I/O system and present a new algorithm for parallel disk scheduling. Traditional buffer management algorithms that minimize the number of block misses are substantially suboptimal in a parallel I/O system where multiple I/Os can proceed simultaneously. We show that in the offline case, where a priori knowledge of all the requests i...
متن کاملImproving Parallel-Disk Buffer Management using Randomized Writeback
We address the problems of I/O scheduling and buffer management for general reference strings in a parallel I/O system. Using the standard parallel disk model withD disks and a shared I/O buffer of size M , we study the performance of on-line algorithms that use bounded global M block lookahead. We introduce the concept of write-back whereby blocks are dynamically relocated between disks during...
متن کاملScheduling Multiple Flows on Parallel Disks
We examine the problem of scheduling concurrent independent flows on multiple-disk I/O storage systems. Two models are considered: in the shared buffer model the memory buffer is shared among all the disks, while in the partitioned buffer model each flow has a private buffer. For the parallel disk model with d > 1 disks it is shown that the problem of minimizing the schedule length of n > 2 con...
متن کاملAn improved parallel disk scheduling algorithm
We address the problems of prefetching and I/O scheduling for read-once reference strings in a parallel I/O system. Read-once reference strings, in which each block is accessed exactly once, arise naturally in applications like databases and video retrieval. Using the standard parallel disk model with D disks and a shared I/O buffer of size M , we present a novel algorithm, Red-Black Prefetchin...
متن کاملTight Bounds for Prefetching and Buffer Management Algorithms for Parallel I/O Systems
ÐThe I/O performance of applications in multiple-disk systems can be improved by overlapping disk accesses. This requires the use of appropriate prefetching and buffer management algorithms that ensure the most useful blocks are accessed and retained in the buffer. In this paper, we answer several fundamental questions on prefetching and buffer management for distributed-buffer parallel I/O sys...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999